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Clostridium jeddahense strain JCD T (= CSUR P693 = DSM 27834) is the type strain of C. 
jeddahense sp. nov. This strain, whose genome is described here, was isolated from the fecal 
flora of an obese 24 year-old Saudian male (BMI=52 kg/m 2 ). Clostridium jeddahense strain 
JCD T is an obligate Gram-positive bacillus. Here we describe the features of this organism, 
together with the complete genome sequence and annotation. The 3,613,503 bp long ge- 
nome (1 chromosome, no plasmid) exhibits a G+C content of 51.95% and contains 3,462 
protein-coding and 53 RNA genes, including 4 rRNA genes. 



Introduction 

Clostridium jeddahense strain JCD T (=CSUR P693 = 
DSM 27834), is the type strain of Clostridium 
jeddahense sp. nov. This bacterium is a Gram- 
positive, anaerobic, spore-forming indole, positive 
bacillus that was isolated from the stool of an 
obese 24 year-old Saudian individual, as a part of 
a culturomics study as previously reported. 

The usual parameters used to delineate a bacterial 
species include 16S rDNA sequence identity and 
phylogeny [1,2], genomic G + C content diversity, 
and DNA-DNA hybridization (DDH) [3,4]. Never- 
theless, some limitations appeared notably be- 
cause the cutoff values vary dramatically between 
species and genera [5]. The introduction of high- 
throughput sequencing techniques made genomic 
data for many bacterial species available [6]. We 
recently proposed a new method (taxono- 
genomics), which includes genomic data in a 
polyphasic approach to describe new bacterial 



species [6]. This strategy combines phenotypic 
characteristics, including MALDI-T0F MS spec- 
trum, and genomic analysis [7-37]. 

Here, we present a summary classification and a 
set of features for C. jeddahense sp. nov. strain 
JCDT (=CSUR P693 = DSM 27834), together with 
the description of the complete genome sequenc- 
ing and annotation. These characteristics support 
the circumscription of the species C. jeddahense. 

The genus Clostridium was created in 1880 [38] 
and consists of obligate anaerobic rod-shaped ba- 
cilli able to produce endospores [38]. More than 
200 species have been described to date 
(http : / / www.bacterio.cict.fr /c/ clostridium.html) . 
Members of the genus Clostridium are mostly en- 
vironmental bacteria or associated with the com- 
mensal digestive flora of mammals. However, sev- 
eral are major human pathogens, including C. 
botulinum, C. difficile and C. tetani [38]. 
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Classification and features 

A stool sample was collected from an obese 24- 
year-old male Saudian volunteer patient living in 
Jeddah. The patient gave an informed and signed 
consent, and the agreement of the Ethical Commit- 
tee of the King Abdulaziz University, King Fahd 
medical Research Centre, Saudi Arabia, and the lo- 
cal ethics committee of the IFR48 (Marseille, 
France) were obtained under agreement number 
014-CEGMR-2-ETH-P and 09-022 respectively. 
The fecal specimen was preserved at -80°C after 
collection and sent to Marseille. Strain JCD T (Table 
1) was isolated in July 2013 by anaerobic cultiva- 
tion on 5% sheep blood-enriched Columbia agar 



(BioMerieux, Marcy l'Etoile, France) after a 5-day 
preincubation on blood culture bottle with rumen 
fluid. This strain exhibited a 97.3% nucleotide se- 
quence similarity with Clostridium sporosphae- 
roides strain DSM 1294 (Figure 1). This value was 
lower than the 98.7% 16S rRNA gene sequence 
similarity threshold recommended by 
Stackebrandt and Ebers to delineate a new species 
without carrying out DNA-DNA hybridization [2] 
and was in the 78. 4 to 98.9% range of 16S rRNA 
identity values observed among 41 Clostridium 
species with validly published names [52]. 



Table 1. Classification and general features of Clostridium jeddahense strain JCD T according to the MIGS recom- 
mendations [39] 



MIGS ID 


Property 


Term 


Evidence code 3 




Current classification 


Domain Bacteria 


TAS [40] 






Phylum Firmicutes 


TAS [41-43] 






Class Clostridia 


TAS [44,45] 






Order Clostridials 


TAS [46,47] 






Family Clostridiaceae 


TAS [46,48] 






Genus Clostridium 


IDA [46,49,50] 






Species Clostridium jeddahense 


IDA 






Type strain JCD T 


IDA 




Gram stain 


Positive 


IDA 




Cell shape 


Rod 


IDA 




Motility 


Motile 


IDA 




Sporulation 


Sporulating 


IDA 




Temperature range 


Mesophile 


IDA 




Optimum temperature 


37°C 


IDA 


MIGS-6.3 


Salinity 


Unknown 


IDA 


MIGS-22 


Oxygen requirement 


Anaerobic 


IDA 




Carbon source 


Unknown 


IDA 




Energy source 


Unknown 


IDA 


MIGS-6 


Habitat 


Human gut 


IDA 


MIGS-15 


Biotic relationship 


Free living 


IDA 




Pathogenicity 


Unknown 






Biosafety level 


2 




MIGS-14 


Isolation 


Human feces 




MIGS-4 


Geographic location 


Jeddah, Saudi Arabia 


IDA 


MIGS-5 


Sample collection time 


July 2013 


IDA 


MIGS-4.1 


Latitude 


21.422487 


IDA 


MIGS-4.1 


Longitude 


39.856184 


IDA 


MIGS-4.3 


Depth 


Surface 


IDA 


MIGS-4.4 


Altitude 


0 m above sea level 


IDA 



Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists 
in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sam- 
ple, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes 
are from the Gene Ontology project [51]. If the evidence is IDA, then the property was directly observed for a 
live isolate by one of the authors or an expert mentioned in the acknowledgements. 
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Figure 1. A consensus phylogenetic tree highlighting the position of Clostridium jeddahense strain JCD T rela- 
tive to other type strains within the Clostridum genus. GenBank accession numbers are indicated in parenthe- 
ses. Sequences were aligned using CLUSTALW, and phylogenetic inferences were obtained using the maxi- 
mum-likelihood method in the MEGA software package. Numbers at the nodes are the percentages of boot- 
strap values from 500 replicates that support the node. Clostridium ramosum was used as outgroup. The scale 
bar represents a 2% nucleotide sequence divergence. 



Four growth temperatures (25, 30, 37, 45°C) were 
tested; growth occurred between 25 and 37°C, but 
optimal growth was observed at 37°C, 24 hours af- 
ter inoculation. No growth occurred at 45°C. Colo- 
nies were translucent and approximately 0.2 to 
0.3 mm in diameter on 5% sheep blood-enriched 
Columbia agar (BioMerieux). Growth of the strain 
was tested on the same agar under anaerobic and 
microaerophilic conditions using GENbag anaer 
and GENbag microaer systems, respectively 
(BioMerieux), and in aerobic conditions, with or 
without 5% CO2. Growth was observed only an- 
aerobically. No growth occurred in aerobic or 
microaerophilic conditions. Gram staining showed 
Gram-positive rods able to form spores (Figure 2). 
A motility test was positive. Cells grown on agar 
exhibit a mean diameter of 1 urn and a mean 
length of 1.22 urn in electron microscopy (Figure 
3). 



Strain JCD T exhibited neither catalase nor oxidase 
activity (Table 2). Using an API Rapid ID 32A strip 
(BioMerieux), positive reactions were obtained for 
indole production, alkaline phosphatase, arginine 
arylamidase, proline arylamidase, alanine aryl- 
amidase, glycine arylamidase, histidine aryl- 
amidase, glutamyl glutamic acid arylamidase and 
serine arylamidase. Negative reactions were ob- 
tained for arginine dihydrolase, a-galactosidase, (B- 
galactosidase, a-glucosidase, (B-glucosidase, a- 
arabinosidase, N-acetyl-(B-glucosaminidase, glu- 
tamic acid decarboxylase, a-fucosidase, nitrate re- 
duction, leucyl glycine arylamidase, fermentation 
of mannose and raffinose, urease, (B-galactosidase- 
6-phosphatase, (B-glucuronidase, phenylalanine 
arylamidase, leucine arylamidase, pyroglutamic 
acid arylamidase and tyrosine arylamidase. Using 
an API 50CH strip (Biomerieux), strain JCD T was 
asaccharolytic. 
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C. jeddahense is susceptible to amoxicillin, amoxi- ceftriaxone, ciprofloxacin and trimethoprim- 

cillin-clavulanate, imipenem, metronidazole, doxy- sulfamethoxazole. The comparisons with other 

cycline, rifampicin, vancomycin but resistant to Clostridium species are summarized in Table 2. 




500 nm 



Figure 3. Transmission electron micrograph of C. jeddahense strain 
JCD T , taken using a Morgani 268D (Philips) at an operating voltage of 
60kV.The scale bar represents 500 nm. 
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Matrix-assisted laser-desorption/ionization time- 
of-flight (MALDI-TOF) MS protein analysis was 
carried out as previously described [54]. Briefly, a 
pipette tip was used to pick one isolated bacterial 
colony from a culture agar plate and spread it as a 
thin film on a MTP 384 MALDI-TOF target plate 
(Bruker Daltonics, Leipzig, Germany). Twelve dis- 
tinct deposits from twelve isolated colonies were 
performed for strain JCD T . Each smear was over- 
laid with 2 |iL of matrix solution (saturated solu- 
tion of alpha-cyano-4-hydroxycinnamic acid] in 50% 
acetonitrile, 2.5% tri-fluoracetic acid, and allowed 
to dry for 5 minutes. Measurements were per- 
formed with a Microflex spectrometer [Bruker]. 
Spectra were recorded in the positive linear mode 
for the mass range of 2,000 to 20,000 Da (parame- 
ter settings: ion source 1 [ISI], 20kV; IS2, 18.5 kV; 
lens, 7 kV). A spectrum was obtained after 675 
shots with variable laser power. The time of ac- 
quisition was between 30 seconds and 1 minute 
per spot. The twelve JCD T spectra were imported 



into the MALDI BioTyper software (version 2.0, 
Bruker) and analyzed by standard pattern match- 
ing (with default parameter settings] against the 
main spectra of 3,769 bacteria, including 228 
spectra from 96 Clostridium species. The method 
of identification included the m/z from 3,000 to 
15,000 Da. For every spectrum, a maximum of 100 
peaks were compared with spectra in database. 
The resulting score enabled the identification of 
tested species, or not: a score 0 2 with a validly 
published species enabled identification at the 
species level, a score 0 1.7 but < 2 enabled identi- 
fication at the genus level, and a score < 1.7 did 
not enable any identification. No significant 
MALDI-TOF score was obtained for strain JCD T 
against the Bruker database, suggesting that our 
isolate was not a member of a known species. We 
added the spectrum from strain JCD T to our data- 
base (Figure 4). Finally, the gel view showed the 
spectral differences with other members of the 
genus Clostridium (Figure 5). 



! xio 4 ' 




2000 4000 6000 8000 10000 12000 14000 16000 18000 

m/z 

Figure 4. Reference mass spectrum from C. jeddahense strain JCD T . Spectra from 12 individual colo- 
nies were compared and a reference spectrum was generated. 
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Clostridium sporosphaeroides 
DSM 1294TVML 



Clostridium senega lense 

DSM 25507 



Clostridium jeddahense 
DSM 27S34 



Clostridium difficile 
DSM 12057 



Clostridium dakarense 
DSM 27086 



Clostridium beijerinckii 
1011 DSM 552 BOG 



Speclrum 




Figure 5: Gel view comparing C. jeddahense strain JCD T to other Clostridium species. The gel view displays the 
raw spectra of loaded spectrum files arranged as a pseudo-electrophoretic gel. The x-axis records the m/z value. 
The left y-axis displays the running spectrum number originating from subsequent spectra loading. The peak in- 
tensity is expressed by a grey scale scheme code. The grey scale bar on the right y-axis indicates the relation 
between the shade of grey a peak is displayed with and the peak intensity in arbitrary units. Species names are 
shown on the left. 



Genome sequencing information 

Genome project history 

The organism was selected for sequencing on the 
basis of its phylogenetic position and 16S rDNA 
similarity to members of the genus Clostridium, 
and is part of a study of the human digestive flora 
aiming at isolating all bacterial species in human 
feces [55]. It was the 101 st genome of a Clostridium 



species and the first genome of C. jeddahense sp. 
nov. The GenBank accession number is 
CBYL00000000. The assembly consists of 104 
contigs. Table 3 shows the project information and 
its association with MIGS version 2.0 compliance 
[39]. 



Table 3. Project information 



MIGS ID 


Property 


Term 


MIGS-31 


Finishing quality 


High-quality draft 


MIGS-28 


Libraries used 


Paired end and Mate pair 


MIGS-29 


Sequencing platform 


MySeq lllumina 


MIGS-31. 2 


Fold coverage 


94.91 x 


MIGS-30 


Assemblers 


Newbler 


MIGS-32 


Gene calling method 
Genbank Date of Re- 


PRODIGAL 




lease 


February 12, 2014 




Genbank project ID 


CBYL00000000 


MIGS-13 


Project relevance 


Study of the human gut microbiome 
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Growth conditions and DNA isolation 

C. jeddahense sp. nov., strain JCD T (= CSUR P693 = 
DSM 27834) was grown on 5% sheep blood- 
enriched Columbia agar (BioMerieux) at 37°C in 
anaerobic atmosphere. Bacteria grown on three 
Petri dishes were harvested and resuspended in 
4xl00nL of TE buffer. Then, 200 \iL of this suspen- 
sion was diluted in 1ml TE buffer for lysis treat- 
ment that included a 30- minute incubation with 
2.5 |ig/|iL lysozyme at 37°C, followed by an over- 
night incubation with 20 ng/M-L proteinase K at 
37°C. Extracted DNA was then purified using 3 
successive phenol-chloroform extractions and 
ethanol precipitation at -20°C overnight. After 
centrifugation, the DNA was resuspended in 160 
|iL TE buffer. 

Genome sequencing and assembly 

Genomic DNA of Clostridium jeddahense was se- 
quenced on a MiSeq sequencer (Illumina, Inc, San 
Diego CA 92121, USA) with 2 applications: paired 
end and mate pair. The paired end and the mate 
pair strategies were barcoded in order to be 
mixed respectively with 14 other genomic pro- 
jects constructed according the Nextera XT library 
kit (Illumina) and 11 others projects with the 
nextera Mate pair kit (Illumina). 

The gDNA was quantified by a Qubit assay with 
the high sensitivity kit (Life technologies, Carlsbad, 
CA, USA) to 11.1 ng/uL and dilution was per- 
formed such that lng of each strain's gDNA was 
used to construct the paired end library. The 
"tagmentation" step fragmented and tagged the 
DNA .Then limited cycle PCR amplification com- 
pleted the tag adapters and introduced dual-index 
barcodes. After purification on Ampure beads 
(Life Technolgies, Carlsbad, CA, USA), the libraries 
were normalized on specific beads according to 
the Nextera XT protocol (Illumina). Normalized li- 
braries are pooled into a single library for se- 
quencing on the MiSeq. The pooled single strand 
library was loaded onto the reagent cartridge and 
then onto the instrument along with the flow cell. 
Automated cluster generation and paired-end se- 
quencing with dual index reads was performed in 
a single 39-hour run at a 2x250 bp read length. 
Within this pooled run, the index representation 
was determined to be 7.3%. Total information of 
5.3 Gbases was obtained from a 574 K/mm2 den- 
sity with 95.4% (11,188,000 clusters) of the clus- 
ters passing quality control (QC) filters. From the 
genome sequencing process, the 753,292 pro- 



duced Illumina reads for Clostridium jeddahense 
were filtered according to the read qualities. 

The mate pair library was constructed from 1 |ig 
of genomic DNA using the Nextera Mate Pair 
Illumina guide. The genomic DNA sample is simul- 
taneously fragmented and tagged with a mate pair 
junction adapter. The profile of the fragmentation 
was validated on an Agilent 2100 BioAnalyzer 
(Agilent Technologies, Inc., Santa Clara, CA, USA) 
with a DNA7500 labchip. The DNA fragments 
range in size from 1 kb up to 11 kb with a mean 
size of 7kb. No size selection was performed and 
600 ng tagmented fragments were circularized. 
The larger circularized DNA molecules were phys- 
ically sheared to smaller sized fragments with a 
mean size of 625 bp on the Covaris device S2 in 
microtubes (Woburn, MA, USA) .The library's pro- 
file and the quantitation were visualized on a High 
Sensitivity Bioanalyzer LabChip. The libraries 
were normalized to 2 nM and pooled. After a de- 
naturation step and dilution at 10 pM the pool of 
libraries was loaded onto the reagent cartridge 
and then onto the instrument along with the flow 
cell. Automated cluster generation and sequencing 
run was performed in a single 39-hour run at a 
2x250 bp read length. 

Total information of 3.9 Gb was obtained from a 
399 K/mm2 density with 97.9% (7,840,000 clus- 
ters) of the clusters passing quality control (QC) 
filters. Within this pooled run, the index represen- 
tation for Clostridium jeddahense was determined 
to be 6.54%. 

From this genome sequencing process, the 
501,426 produced Illumina reads for Clostridium 
jeddahense were filtered according to the read 
qualities. 

Genome annotation 

Open Reading Frames (ORFs) were predicted us- 
ing Prodigal [56] with default parameters. How- 
ever, the predicted ORFs were excluded if they 
spanned a sequencing gap region. The predicted 
bacterial protein sequences were searched against 
the GenBank [57] and Clusters of Orthologous 
Groups (COG) databases using BLASTP. The tRNAs 
and rRNAs were predicted using the tRNAScan-SE 
[58] and RNAmmer [59] tools, respectively. Signal 
peptides and numbers of transmembrane helices 
were predicted using SignalP [60] and TMHMM 
[61], respectively. Mobile genetic elements were 
predicted using PHAST [62] and RAST [63]. 
ORFans were identified if their BLASTP £-value 
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was lower than le-03 for alignment length greater 
than 80 amino acids. If alignment lengths were 
smaller than 80 amino acids, we used an £-value 
of le-05. Such parameter thresholds have already 
been used in previous work to define ORFans. Ar- 
temis [64] and DNA Plotter [65] were used for da- 
ta management and visualization of genomic fea- 
tures, respectively. The Mauve alignment tool 
(version 2.3.1) was used for multiple genomic se- 
quence alignment [66]. 

To estimate the mean level of nucleotide se- 
quence similarity at the genome level between C. 
jeddahense and 7 other members of the genus 
Clostridium, we used the Average Genomic Identi- 
ty Of gene Sequences (AGIOS) home-made soft- 
ware [6]. Briefly, this software combines the 
Proteinortho software [67] for detecting ortholo- 
gous proteins between pairs of genomes, then re- 
trieves the corresponding genes and determines 
the mean percentage of nucleotide sequence iden- 
tity among orthologous ORFs using the Needle- 



man-Wunsch global alignment algorithm. C. 
jeddahense strain JCD T was compared to C. 
senegalense strain JC122, C. dakarense strain FF1, 
Clostridium beijerinckii strain NCIMB 8052, C. 
difficile strain Bl, Clostridium cellulolyticum strain 
H10, Clostridium leptum strain DSM 753, and Clos- 
tridium sporosphaeroides strain DSM 1294 (see 
Table 6B). 

Genome properties 

The genome is 3,613,503 bp long (1 chromosome, 
but no plasmid) with a 51.95% G+C content (Fig- 
ure 6 and Table 4). Of the 3,515 predicted genes, 
3,462 were protein-coding genes and 53 were 
RNAs, including 4 rRNAs. A total of 2,193 genes 
(62.38%) were assigned a putative function and 
81 genes were identified as ORFans (2.3%). The 
properties and statistics of the genome are sum- 
marized in Tables 4 and 5. The distribution of 
genes into COG functional categories is presented 
in Table 5. 



3600000 
i i 




1800000 



Figure 6. Graphical circular map of the chromosome. From the outside in: open reading frames 
oriented in the forward (colored by COG categories) direction, open reading frames oriented in 
the reverse (colored by COG categories) direction, RNA operon (red), and tRNAs (green), GC 
content plot, and GC skew (purple: negative values, olive: positive values). 
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Table 4. Nucleotide content and gene count levels of the genome 




Attribute 




Value % of total 3 




Genome size (bp) 




3,613,503 




DNA G+C content (bp) 




1,877,214 51.95 




DNA Coding region (bp) 




3,152,277 87.23 




Number of replicons 




1 




Extra chromosomal element 


0 




Total genes 




3,515 100 




RNA genes 




53 1.51 




Protein-coding genes 




3,462 98.49 




Genes with function prediction 


2,193 87.19 




Genes assigned to COGs 




2,515 71.55 




Genes with peptide signals 


135 3.84 




Genes with transmembrane helices 


887 25.23 




a The total is based on eith< 


er the size 


of the genome in base pairs or the total number of 




protein-coding genes in the annotated 


genome 


Table 5. Number of genes associated 


with the 25 general COG functional categories 


Code 


Value 


% age 3 


Description 


J 


154 


4.45 


Translation 


A 


0 


0 


RNA processing and modification 


K 


296 


8.55 


Transcription 


L 


138 


3.98 


Replication, recombination and repair 


B 


1 


0.03 


Chromatin structure and dynamics 


D 


24 


0.69 


Cell cycle control, mitosis and meiosis 


Y 


0 


0 


Nuclear structure 


V 

V 


/ _> 


7 1 1 


Dp>fp>ncp* mprh^nicmc 


T 
1 


1 -J u 


A 5 


Ciona \ttx ncni \c^\c\x\ mprhanicinc 

JltMldl LI dl I5U ULAI Ul 1 1 1 ltrL,l Idl 1 IM 1 13 


M 

/VI 




J.JJ 


p>ll \a^3 /mpmhranp ninopnpcic 
v^,cll vv d 1 1/ 1 1 Itrl 1 IUI d 1 It \J lutltrl ICS Id 


M 

IN 


DZ 


1 70 

i ./y 


Cell motility 


Z 


0 


0 


Cytoskeleton 


W 


0 


0 


Extracellular structures 


u 


48 


1. 38 


Intracellular trafficking and secretion 








Posttranslational modification, protein turnover, chaper- 


o 


66 


1.9 


ones 


c 


154 


4.45 


Energy production and conversion 


G 


237 


6.84 


Carbohydrate transport and metabolism 


E 


328 


9.47 


Amino acid transport and metabolism 


F 


56 


1.61 


Nucleotide transport and metabolism 


H 


92 


2.66 


Coenzyme transport and metabolism 


I 


85 


2.45 


Lipid transport and metabolism 


P 


164 


4.74 


Inorganic ion transport and metabolism 








Secondary metabolites biosynthesis, transport and catab- 


Q 


53 


1.53 


olism 


R 


346 


10 


General function prediction only 


S 


195 


5.63 


Function unknown 




947 


27.35 


Not in COGs 



a The total is based on the total number of protein-coding genes in the annotated genome 
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Genome comparison with other Clos- 
tridium genomes 

We compared the genomes of C.jeddahense JCD T , C. 
sporosphaeroides DSM 1294, C. leptum DSM 753, C. 
beijerincki NCIMB 8052, C. cellulolyticum H10, C. 
difficile Bl, C. senegalense DSM 25507, C. 
dakarense DSM 27086 (Table 6A). 

The draft genome of C. jeddahense (3.61 Mb] is 
larger than C. sporosphaeroides and C. leptum 
(3.17 and 3.27 Mb respectively] but smaller than C. 
beijerincki, C. cellulolyticum, C. difficile, C. 
senegalense and C. dakarense (6.0, 4.07, 4.46, 3.89, 
3.73 Mb respectively]. It exhibits a higher G+C 
content than all other compared genome except C. 
sporosphaeroides (53.5%]. C. jeddahense has a 
higher gene content (3,462] than C. 
sporosphaeroides, C. difficile (2,951 and 3,390 re- 
spectively] but smaller than C. leptum, C. beijer- 
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incki, C. cellulolyticum, C. senegalense and C. dakar- 
ense (3,591, 3,818, 3,923, 3,704, 5,020 respective- 
ly]. C. jeddahense shared 1,573, 876, 816, 847, 
1,030, 770 and 1,044 orthologous genes with C. 
sporosphaeroides,C. cellulolyticum, C. dakarense, C. 
difficile, C. leptum, C. senegalense and C. beijerincki 
respectively. 

When we compared C. jeddahense with other spe- 
cies, AGIOS values ranged from 57.52 with C. 
senegalense to 91.97% with C. sporosphaeroides. 
Although the AGIOS value was elevated between C. 
jeddahense and C. sporosphaeroides, we believe 
that the remarkable phenotypic differences, in- 
cluding motility, indole production (Table 2], and 
protein profile (Figure 7], enable the classification 
of C. jeddahense as a new species. 



Table 6A. Genomic comparison of C. jeddahense with 7 other Clostridium species + . 

Genome accession Genome size 
Species Strain number (Mb) G+C content 



C. jeddahense 
C. sporosphaeroides 
C. cellulolyticum 
C. dakarense 



JCD T 



DSM 1294 



H10 



CBYL00000000 



ARTA01 000000 



NC 011898 



DSM 27086 CBTZ01 0000000 



3.61 
3.17 
4.07 
3.73 



51.95 
53.5 
37.4 

27.98 



C. difficile 



B1 



NC 017179 



4.46 



28.4 



C. leptum 



DSM 753 



ABCB02000000 



3.27 



50.2 



C. senegalense 
C. beijerincki 



DSM 25507 CAEV01 000001 



NCIMB 

8052 



NC 009617 



3.89 



6.0 



26.8 



29.0 



+ A: Species, Strain, GenBank accession number, genome size and G+C content of all compared genomes 
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Figure 7. Distribution of predicted genes of C. jeddahense and 7 other Clostridium species into COG catego- 
ries. C.jdm= C. jeddahense, C.spo= C. sporosphaeroides, C. Iep= C. leptum, C.bej = C. beijerinckii, C. eel = 
C. cellulolyticum, C. diff = C. difficile, C. sen = C. senegalense, C. dak = C. dakarense. 



Conclusion 

On the basis of phenotypic, phylogenetic and ge- 
nomic analyses (taxono-genomics), we formally 
propose the creation of Clostridium jeddahense sp. 
nov. that contains strain JCD T . This strain was iso- 
lated from the fecal flora of an obese 24 year-old 
Saudian individual living in Jeddah. 

Description of C. jeddahense sp. nov. 

Clostridium jeddahense Qed.dah..en'.se L.gen. neutr. 
n. combination of Jeddah, the city in Saudi Arabia 
where the specimen was obtained from an obese 
Saudian patient sample.) Transparent colonies 
were 0.2 to 0.3 mm in diameter on blood-enriched 
agar. C. jeddahense is a Gram-positive, obligate an- 
aerobic, endospore-forming bacterium with a 
mean diameter of 1 u.m. Optimal growth on axenic 
medium was observed at 37°C. 

C. jeddahense is catalase negative and oxidase neg- 
ative. Alkaline phosphatase, arginine arylamidase, 
proline arylamidase, alanine arylamidase, glycine 
arylamidase, histidine arylamidase, glutamyl glu- 
tamic acid arylamidase and serine arylamidase ac- 



tivities were positive. Arginine dihydrolase, a- 
galactosidase, (3-galactosidase, a-glucosidase, (B- 
glucosidase, a-arabinosidase, N-acetyl-(B- 
glucosaminidase, glutamic acid decarboxylase, a- 
fucosidase, reduction of nitrate, leucyl glycine 
arylamidase, fermentation of mannose and 
raffinose, urease, (3-galactosidase-6-phosphatase, 
(3-glucuronidase, phenylalanine arylamidase, 
leucine arylamidase, pyroglutamic acid 
arylamidase and tyrosine arylamidase activities 
were negative. Asaccharolytic. Positive for indole. 
Cells are susceptible to amoxicillin, amoxicillin- 
clavulanate, imipenem, metronidazole, doxycy- 
cline, rifampicin, vancomycin but resistant to 
ceftriaxone, ciprofloxacin and trimethoprim- 
sulfamethoxazole. 

The G+C content of the genome is 51.95%. The 
16S rDNA and genome sequences are deposited in 
GenBank under accession numbers HG726040 
and CBYL00000000, respectively. The type strain 
is JCDT (= CSUR P693 = DSM 27834). 
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